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(1) GENERAL IK FORMAT I ON 
(i) APPLICANT; SHANGHAI SECOND MEpICAL UNIVERSITY. 

(ii) TITLE OF THE INVENTION: A HUMAN HSG.III GENE 

(iii) NUMBER OF SEQUENCES: 2 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE:, Ratner & Presria 

(B) STREET: P.O. Box 980 

(C) CITY: Valley Forge 

(D) STATE! PA 

(£) COUNTRY: USA / 
(F) ZIP: 19482 / 

(V) COMPUTER READABLE FORM: / 

(A) MEDIUM TYPE: Diskette / 

(B) COMPUTER: IBM Compatible : / 

(C) OPERATING SYSTEM: DOS / 

(D) SOFTWARE: FastSEQ for Windows Vetai/Sn 2,.0 

(vi) CURRENT APPLICATION DATA: / 

(A) APPLICATION NUMBER: TO BE ASSIGNED 

(B) FILING DATE: / 

(C) CLASSIFICATION: UNKNOWN / 

(vii) PRIOR APPLICATION DATA: / 

(A) APPLICATION NUMBER: / 

(B) FILING DATE: / 



(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Prestia, Paul F 

(B) REGISTRATION NUMBER: 23, 031 

(C) REFERENCE/DOCKEJ NUMBER: GF-70526 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 6*0-407-0700 

(B) TELEFAX: 6L0-4O7-0700 

(C) TELEX: 846169 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2017 base pairs 
<B) TYPE: nucleic acid 

(C) STOANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 : 

TAAAGCTACG CCCTGGCCGC AGTCTCCGCG TCACAGGAAC TTCAGCACCC AGAGGGCGGA 60 

CAGCGCTCCC CTCTACCTGG AGACTTGACT CCCGCGCGCC CCAACCCTGC TTATCCCTTG 120 

ACCGTCGAGT GTCAGAGATC CTGCAGCCGC CCAGTCCCGG QCCCTCTCCC GCCCC&CACC 180 

CACCCTCCTG GCTCTTCCTG TTTTTACTCC TCCTTTTCAT TCATAACAAA AGCTACAGCT 240 

CCAGGAGCCC AGCGCCGGGC TGTGACCCAA GCCQAGCGTG GAAGAATGGG GTTQCTCGGG 300 

ACCGGCACTT GGATTCTGGT GTTAGTGCTC CCGATTCAAG CTTTCCCCAA ACCTGGAGGA 360 

AGCCAAGACA AATCTCTACA TAATAGAGAA TTAAGTGCAG AAAGACCTTT GAATGAACAG 420 

ATTGCTGAAG CAGAAGAAGA CAAGATTAAA AAAACATATC CTCCAGAAAA CAAGCCAGGT 460 

CAGAGCAACT ATTCTTTTGT TGATAACTTG AACCTGCTAA GGGCAATAAC AGAAAAGGM 540 

AAAATTGAGA AAGAAAGACA ATCTATAAGA AGCTCCCCAC TTGATAATAA GTTGAATGTG 600 

GAAGATGTTG ATTCAACCAA GAATCGAAAA CTGATCGATG ATTATGACTC TACTAAGAGT 660 

GGATTGGATC ATAAATTTCA AGATGATCCA GATGGTCTTC ATCAACTAGA CGGGACTCCT 720 

TTAACCGCTG AAGACATTGT CCATAAAATC GCTGCCAGGA TTTATGAAGA AAATGACAGA 78 0 

GCCGTGTTTG ACAAGATTGT TTCTAAACTA CTTAATCTCG GCCTTATCAC AGAAAGCCAA 84 0 

GCACATACAC TGGAAGATGA AGTAGCAGAG GTTTTACAAA AATTAATCTC AAAGGAAGCC 900 

AACAATTATG AGGAGGATCC CAATAAGGCC ACAAGCTGGA CTGAGAATCA GGCTGGAAAA 960 

ATACCAGAGA AA6TGACTCC AATGGCAGCA ATTCAAGATG GTCTTGCTAA GGGAGAAAAC 1020 

^ GATGAAACAG TATCTAACAC ATTAACCTTG ACAAATGGCT TGGAAAGGAG AACTAAAACC 1080 

TACAGTGAAG ACAACTTTAG GGACTTCCAA TATTTCCCAA ATTTCTATGC GCTACTGAAA 114 0 

^3 AGTATTGATT CAGAAAAAGA AGCAAAAGAG AAAGAAACAC TGATTACTAT CATGAAAACA 1200 

IJl CTGATTGACT TTGTGAAGAT GATGGTGAAA TATGGAACAA TATCTCCAGA AGAAGGTGTT 1260 

yi TCCTACCTTG AAAACTTGGA TGAAATGATT GCTCTTCAGA CCAAAAACAA GCTAGAAAAA 1320 

AATGCTACTG ACAATATAAG CAAGCTTTTC CCAGCACCAT CAGAGAAGAG TCATGAAGAA 138 0 

1» ACAGACAGTA CCAAGGAAGA AGCAGCTAAG ATGGAAAAGG AATATGGAAG CTTGAAGGAT 144 0 

TCCACAAAAG ATGATAACTC CAACCCAGGA GGAAAGACAG ATGAACCCAA AGGAAAAACA 1500 

4* GAAGCCTATT TGGAAGCCAT CAGAAAAAAT ATTGAATGGT TGAAGAAACA TGACAAAAAG 1560 

yi GGAAATAAAG AAGATTATGA CCTTTCAAAG ATGAGAGACT TCATCAATAA ACAAGCTGAT 1620 

GCTTATGTGG AGAAAGGCAT CCTTGACAAG GAAGAAGCCG AGGCCATCAA GCGCATTTAT 168 0 

AGCAGCCTGT AAAAATGGCA AAAGATCCAG GAGTCTTTCA ACTGTTTCAG AAAACATAAT 1740 

**f ATAGCTTAAA ACACTTCTAA TTCTGTGATT AAAATTTTTT GACCCAAGGG TTATTAGAAA 1800 

U1 GTGCTGAATT TACAGTAGTT AACCTTTTAC AAGTGGTTAA AACATAGCTT TCTTCCCGTA 1860 

fj AAAACTATCT GAAAGTAAAG TTGTATGTAA GCTGAGATTT TGTATACAGG AATCCTTATT 1920 

fj| TCCTCATAGN CTTATTATTT TATAATCAGG AATATGTTGC TTTGGAAAAA GCCTCTTAAT 1980 

pi GGGCTGACCN TAAAAACTCA ATCCNTCTTC CACTGTC 2017 

O (2) INFOBMATION FOR SEQ ID NO: 2: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
{D] TOPOLOGY: linear 

(ii) M0L£CULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION; SEQ ID N0:2: 

Met Gly Phe Leu Gly Thr Gly Thr Trp He Leu Val Leu Val Ley Pro 

1 5 io xS 

He Gin Ala Phe Pro Lys'Pro Gly Gly Ser Gin Asp Lys Ser Leu His 

20 25 30 

Asn Arg Glu Leu Ser Ala Glu Arg Pro Leu Aan Glu Gin He Ala Glu 
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35 40 45 

Ala Glu Glu Asp Lys lie Lys Lya Thr Tyr Pro Pro Glu Asn Lya. Pro 

50 55 60 

Gly Gin Ser Asn Tyr Ser Phe Val Asp Asn Leu Asn Leu Leu Arg Ala 
65 70 75 80 

He Thr Glu Lys Glu Lys lie Glu Lys Glu Arg Gin Ser lie Arg Ser 

as 90 95 

Ser Pro Leu Asp Asn Lys Leu Asn Val Glu Asp Val Asp Ser Thr Lys 

100 105 HO 

Asn Arg Lys Leu lie Asp Asp Tyr Asp Ser Thr Lys Ser Gly Leu Asp 

115 120 125 

His Lys phe Gin Asp Asp Pro Asp Gly Leu His Gin Leu Asp Gly Thr , 

130 135 140 

Pro Leu Thr Ala Glu Asp He Val Hia Lys He Ala Ala Arg He Tyr 
145 150 1S5 160 

Glu Glu Asn Asp Arg Ala Val Phe Asp Lys He Val ser Lys Leu Leu 

165 170 175 

Asn Leu Gly Leu He Thr Glu Ser Gin Ala His Thr Leu Glu Asp Glu 

180 185 190 

Val Ala Glu Val Leu Gin Lys Leu He Ser Lys Glu Ala Asn Asn Tyr 

195 200 205 

Glu Glu Asp Pro Asn Lys Pro Thr Ser Trp Thr Glu Asn Gin Ala Gly 

210 215 220 

Lys He Pro Glu Lys Val Thr Pro Met Ala . Ala He Gin Asp Gly Leu 
2 * 5 230 235 • 240 

Ala Lys Gly Glu Asn Asp Glu Thr Val Ser Asn Thr Leu Thr Leu Thr 

245 250 255 

Asn Gly Leu Glu Arg Arg Thr Lys Thr Tyr Ser Glu Asp Asn Phe Airg 

260 265 270 

Asp Phe Gin Tyr Phe Pro Asn Phe Tyr Ala Leu Leu Lys Ser He Asp 

275 280 285 

Ser Glu Lys Glu Ala Lys Glu Lys Glu Thr Leu He Thr He Met Lys 

290 295 300 

Thr Leu He Asp Phe Val Lys Met Met Val Lys Tyr Gly Thr He Ser 
3^5 310 315 "= 320 

Pro Glu Glu Gly Val Ser Tyr Leu Glu Asn Leu Asd Glu Met He' Ala 

325 330 335 

Leu Gin Thr Lys Asn Lys Leu Glu Lys Asn Ala Thr Asp Asn He Ser 

340 345 350 

Lys Leu Phe Pro Ala Pro Ser Glu Lys Ser His Glu Glu Thr A$p Ser . 

355 360 . 365 

Thr Lys Glu Glu Ala Ala Lys Met Glu Lys Glu Tyr Gly Ser Leu Lys 

370 375 • 380 

Asp Ser Thr Lys Asp Asp Asn Ser Asn Pro Gly Gly Lys Thr Asp Glu 
385 390 395 400 

Pro Lys Gly Lys Thr Glu Ala Tyr Leu Glu Ala He Arg Lys Asn He 

405 410 415 

Glu Trp Leu Lys Lys His Asp Lys Lys Gly Asn Lys Glu Asp Tyr Asp 

420 425 430 

Leu Ser Lys Met Arg Asp Phe He Asn Lys Gin Ala Asp Ala Tyr Val 

435 440 445 

Glu Lys Gly He Leu Asp Lys Glu Glu Ala Glu Ala He Lys Arg lie 

450 '455 460 

Tyr Ser Ser Leu 
465 
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SEQUENCE INFORMATION 
SEQ ID NO?l ^ 

taaagctacgccctggccgcagtctccgcgtcac^^ 

cagcgctcccctctacc^agacttgactcccacgcgcccc^ccctgcttatcccttc 
5 accgtcgagtgtcagagatcctgcagccgcccagtcccggcccctctcccgccccacacc 
caccctcctggctcttcctotttttactcctcct 

ccaggagcccagcgccgggctotgacccaagccgagcgtggaagaatggggttcctcggg 
accggcacttggattctggtgt^gtgctcccgattcaagctttccccaaacctggagga 
agccaagacaaatctctacataatogagaattaagtgcagaaagacctttgaatgaacag 

1 0 ATTGCTGAAGCAGAAGAAGACAAG;^ 

cagagcaactattcttttgttgataactttg^ 

aaaattgagaaagaaagacaatctatamaagctccccacttgataataagttgaatgtg 
gaagatgttgattgaaccaagaatcgaaaactgatcgatgattatgactctactaagagx 
ggattggatcataaatttcaagatgatccmaxggtcttcatcaactagacgggactcct 
1 5 ttaaccgctgaagacattgtccataaaatc^^ 
gccgtgtttgacaagattgtitctaaactac™^ 

gcacatacactggaagatgaagtagcagaggttotacaaaaattaatctcaaaggaagcc 
aacaattatgaggaggatcccaataagcccacaa(^tggactgagaatcaggctgg.aaaa 

ataccagagaaagtgactccaatggcagcaattcaaWtgotcttgctaagggagaaaac 
20 gatgaaacagtatctaacacattaaccttgac^ 

tacagtgaagacaactttagggacttccaatatttccc^tttctatgcgctactgaaa 
agtattgattcagaaaaagaagcaaaagagaaagaaacaotgattactatcatgaaa?.ca 
ctgaxtgactttgtgaagatgatggtgaaatatggaac^^tctccl^gaagaaggtgt^ 
tcctaccttgaaaacttggatgaaatgattgctcttcagacc^^ 

25 AATGCTACTGACAATATAAGCAAGCTT^^ 

ACAGACAGTACCAAGGAAGAAGCAGCTAAGATGGAAAAGGAArA^GAAGCTTGAAGGAT 
TCCACAAAAGATGATAACTCCAACCC^GGAGGAAAGAC^GATGAArcCAAAGGAAAAACA 

gaagcctatttggaagccatcagaaaaaatattgaatggttgaagaA 
ggaaataaagaagattatgacctttca^^ 
3 0 gcttatgtggagaaaggcatccttg aci^ggaagaagccgagoccatc^cgcatttat 
agcagcctgtaaaaatggcaaaagatc»ggagtc^^ 

a TAGCTTAAAACACTTCTAATTCTGT^^ 

gtgctgaatttac^gtagttaaccttttacaagtggttaaaacatagctttc^ 
aaaactatctgaaagtaaagttgtatgtaagctgagatttc 

35 TCCTCATAGNCTTATTATTTTAT^^ 

GGGCTGACCNTAAAAACTCAATCOrrCTTCCACTGTC 
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MGFLGTGTWILVLVLPIQAFPKPGC^QDKSL^^ 

ENKPGOSWSFVDNLNLLRAITEKEkVeKERQ^ IRSS PtJ^^^KI-NVEDvbSTKItRJpCl idoy 
DSTKSGI^HXFQDDPDGIJJOUDCTFLT^ 
5 ITESQAHTLEDEVAEVMKIjISKE^^ 

AKQEMDETVSNTLTLTNGLERRTiCTVSEDM™^ IDSEKEAKEKETL I 

KSHEETDSTKEEAAKraXFTGSLKDSTra^ 

KHDKKGNKED Yt>LS KMRDF INKQADA YVEKG I LDKEEAEAI KR I YS S L 



• 

SZQ ID NO: 2 
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